Sparse Separation of Under-Determined Speech Mixtures

نویسندگان

  • Paul D. O’Grady
  • Barak A. Pearlmutter
چکیده

We are all familiar with the shape of sound from our secondary school science classes; the typical oscillatory form of a string under tension that decays over time is widely know. At first sight, this representation of sound imparts to the observer nothing more than its duration and amplitude. So how does the brain separate different sounds given such a representation? Over millions of years the Mammalian auditory cortex has evolved to effectively understand the sounds of its natural environment. By learning the distinguishing features of a sound, the brain can recognise and classify many different sounds. It has long been desired to replicate this ability using a machine, which has been the genesis for a topic of study known as source separation. In this thesis we utilise two contrasting strategies for the Separation of under-determined speech mixtures, i.e., the case where there are more sources than mixtures. Furthermore, we impose a sparseness requirement on the sources. First, we introduce a blind source separation method called the LOST algorithm, which is based on a Expectation-Maximisation procedure. The LOST algorithm assumes an instantaneous mixing model, and estimates the columns of the mixing matrix by identifying corresponding linear subspaces in a scatter plot. This method combined with a transformation into a sparse domain and an L1-norm minimisation, constitutes a blind source separation algorithm for the under-determined case, where there are at least two mixtures. Second, we investigate Convolutive Non-negative Matrix Factorisation,

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Underdetermined Model-Based Blind Source Separation of Reverberant Speech Mixtures using Spatial Cues in a Variational Bayesian Framework

In this paper, we propose a new method for underdetermined blind source separation of reverberant speech mixtures by classifying each time-frequency (T-F) point of the mixtures according to a combined variational Bayesian model of spatial cues, under sparse signal representation assumption. We model the T-F observations by a variational mixture of circularly-symmetric complex-Gaussians. The spa...

متن کامل

Underdetermined Anechoic Blind Source Separation

In this paper, we address the problem of under-determined Blind Source Separation (BSS) of anechoic speech mixtures. We propose a demixing algorithm that exploits the sparsity of certain time-frequency expansions of speech signals. Our algorithm merges `-basis-pursuit with ideas based on the degenerate unmixing estimation technique (DUET) [1]. There are two main novel components to our approach...

متن کامل

Underdetermined Sparse Blind Source Separation with Delays

In this paper, we address the problem of under-determined blind source separation (BSS), mainly for speech signals, in an anechoic environment. Our approach is based on exploiting the sparsity of Gabor expansions of speech signals. For parameter estimation, we adopt the clustering approach of DUET [19]. However, unlike in the case of DUET where only two mixtures are used, we use all available m...

متن کامل

A Novel Blind Source Separation Method with Observation Vector Clustering

We propose a new method for separating sparse signals from their mixtures. Separation is achieved by clustering the normalized observation vectors and extracting each cluster as each separated signal. We show the practical result of speech separation with non-linear sensor arrangements in both a determined and an underdetermined scenarios. We also consider the musical noise problem and show the...

متن کامل

8 The DUET Blind Source Separation

This chapter presents a tutorial on the DUET Blind Source Separation method which can separate any number of sources using only two mixtures. The method is valid when sources are W-disjoint orthogonal, that is, when the supports of the windowed Fourier transform of the signals in the mixture are disjoint. For anechoic mixtures of attenuated and delayed sources, the method allows one to estimate...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007